AITopics

2510.07925

Genre:

Overview (1.00)
Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

arXiv.org Artificial IntelligenceAug-18-2025

Allen: Rethinking MAS Design through Step-Level Policy Autonomy

Zhou, Qiangong, Wang, Zhiting, Yao, Mingyou, Liu, Zongyang

We introduce a new Multi-Agent System (MAS) - Allen, designed to address two core challenges in current MAS design: (1) improve system's policy autonomy, empowering agents to dynamically adapt their behavioral strategies, and (2) achieving the trade-off between collaborative efficiency, task supervision, and human oversight in complex network topologies. Our core insight is to redefine the basic execution unit in the MAS, allowing agents to autonomously form different patterns by combining these units. We have constructed a four-tier state architecture (Task, Stage, Agent, Step) to constrain system behavior from both task-oriented and execution-oriented perspectives. This achieves a unification of topological optimization and controllable progress. Allen grants unprecedented Policy Autonomy, while making a trade-off for the controllability of the collaborative structure. The project code has been open source at: https://github.com/motern88/Allen

agent, artificial intelligence, execution, (15 more...)

2508.11294

Genre: Workflow (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Nguyen, Thong Q., Desai, Shubhang, Anwar, Raja Hasnain, Shaik, Firoz, Suryanarayanan, Vishwas, Chowdhary, Vishal

VerificAgent: Domain-Specific Memory Verification for Scalable Oversight of Aligned Computer-Use Agents

arXiv.org Artificial IntelligenceAug-11-2025

Continual memory augmentation lets computer-using agents (CUAs) learn from prior interactions, but unvetted memories can encode domain-inappropriate or unsafe heuristics--spurious rules that drift from user intent and safety constraints. We introduce VerificAgent, a scalable oversight framework that treats persistent memory as an explicit alignment surface. VerificAgent combines (1) an expert-curated seed of domain knowledge, (2) iterative, trajectory-based memory growth during training, and (3) a post-hoc human fact-checking pass to sanitize accumulated memories before deployment. Evaluated on OSWorld productivity tasks and additional adversarial stress tests, VerificAgent improves task reliability, reduces hallucination-induced failures, and preserves interpretable, auditable guidance--without additional model fine-tuning. By letting humans correct high-impact errors once, the verified memory acts as a frozen safety contract that future agent actions must satisfy. Our results suggest that domain-scoped, human-verified memory offers a scalable oversight mechanism for CUAs, complementing broader alignment strategies by limiting silent policy drift and anchoring agent behavior to the norms and safety constraints of the target domain.

large language model, machine learning, natural language, (19 more...)

2506.02539

Genre: Research Report > New Finding (0.54)

Industry: Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.66)

Chang, Edward Y., Geng, Longling

ALAS: A Stateful Multi-LLM Agent Framework for Disruption-Aware Planning

arXiv.org Artificial IntelligenceMay-20-2025

Large language models (LLMs) excel at rapid generation of text and multimodal content, yet they falter on transaction-style planning that demands ACID-like guarantees and real-time disruption recovery. We present Adaptive LLM Agent System (ALAS), a framework that tackles four fundamental LLM deficits: (i) absence of self-verification, (ii) context erosion, (iii) next-token myopia, and (iv) lack of persistent state. ALAS decomposes each plan into role-specialized agents, equips them with automatic state tracking, and coordinates them through a lightweight protocol. When disruptions arise, agents apply history-aware local compensation, avoiding costly global replanning and containing cascade effects. On real-world, large-scale job-shop scheduling benchmarks, ALAS sets new best results for static sequential planning and excels in dynamic reactive scenarios with unexpected disruptions. These gains show that principled modularization plus targeted compensation can unlock scalable and resilient planning with LLMs.

constraint, large language model, machine learning, (20 more...)

2505.12501

Country:

Asia > Middle East > Republic of Türkiye (0.05)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(2 more...)

Genre:

Research Report (1.00)
Workflow (0.69)

Industry:

Transportation > Passenger (1.00)
Consumer Products & Services > Travel (1.00)
Transportation > Air (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceMay-8-2023

Joint Moment Retrieval and Highlight Detection Via Natural Language Queries

Luo, Richard, Peng, Austin, Yap, Heidi, Beard, Koby

Video summarization has become an increasingly important task in the field of computer vision due to the vast amount of video content available on the internet. In this project, we propose a new method for natural language query based joint video summarization and highlight detection using multi-modal transformers. This approach will use both visual and audio cues to match a user's natural language query to retrieve the most relevant and interesting moments from a video. Our approach employs multiple recent techniques used in Vision Transformers (ViTs) to create a transformer-like encoder-decoder model. We evaluated our approach on multiple datasets such as YouTube Highlights and TVSum to demonstrate the flexibility of our proposed method.

artificial intelligence, machine learning, natural language, (22 more...)

2305.04961

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.04)
Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment (0.47)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

#artificialintelligenceNov-27-2021, 16:58:16 GMT

How do we know AI is ready to be in the wild? Maybe a critic is needed

Mischief can happen when AI is let loose in the world, just like any technology. The examples of AI gone wrong are numerous, the most vivid in recent memory being the disastrously bad performance of Amazon's facial recognition technology, Rekognition, which had a propensity to erroneously match members of some ethnic groups with criminal mugshots to a disproportionate extent. Given the risk, how can society know if a technology has been adequately refined to a level where it is safe to deploy? "This is a really good question, and one we are actively working on," Sergey Levine, assistant professor with the University of California at Berkeley's department of electrical engineering and computer science, told ZDNet by email this week. Levine and colleagues have been working on an approach to machine learning where the decisions of a software program are subjected to a critique by another algorithm within the same program that acts adversarially.

conservative q-learning, learning, levine, (14 more...)

Country: North America > United States > California (0.25)

Industry: Information Technology (0.35)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.45)

#artificialintelligenceOct-22-2021, 20:30:21 GMT

AI analytics & Edge compute just accelerated, now what will innovators do with it?

Do not take the Intel portfolio for granted. Sure, Intel products are present everywhere in our digitalised world. But this company is way more than silicon, hardware, and software. Not long ago, Intel introduced customisable silicon (such a win for their customers) and rapid-deployment options like Intel Select Solutions pre-verified configurations of hardware and software. Now, the conversation has turned to the built-in AI acceleration on the newest 3rd Gen Intel Xeon Scalable processors; quite the incredible AI-infused, data-intensive digital solution.

analytic & edge compute just, intel, processor, (9 more...)

Country: Asia > China (0.05)

Industry: Information Technology (0.31)

Technology: Information Technology > Artificial Intelligence (0.99)

#artificialintelligenceOct-5-2020, 06:45:05 GMT

Storage for AI/ML Applications Plays a Key Role at Flash Memory Summit 2020

Virtual Flash Memory Summit (FMS), the world's premiere flash memory conference and exposition, announces a major program track on Storage for Artificial Intelligence and Machine Learning (AI/ML) Applications. The new track features talks on storage strategies, model training, workloads, NVMe and logical volumes, persistent memory, software-defined architectures, and accelerating the GPU data path. It also includes panels on model scalability and long-term horizons, plus a keynote by Geoffrey Burr, Distinguished Researcher at IBM Almaden Research Center. Virtual Flash Memory Summit 2020 will be held on November 10-12 and expects to draw more than 6,000 attendees. AI/ML applications require vast amounts of low latency, high-throughput flash storage.

artificial intelligence, flash memory summit 2020, machine learning, (10 more...)

Industry: Information Technology (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceSep-19-2020, 03:45:12 GMT

EETimes - Memory Technologies Confront Edge AI's Diverse Challenges

With the rise of AI at the edge comes a whole host of new requirements for memory systems. Can today's memory technologies live up to the stringent demands of this challenging new application, and what do emerging memory technologies promise for edge AI in the long-term? The first thing to realize is that there is no standard "edge AI" application; the edge in its broadest interpretation covers all AI-enabled electronic systems outside the cloud. That might include "near edge," which generally covers enterprise data centers and on-premise servers. Further out are applications like computer vision for autonomous driving.

application, artificial intelligence, machine learning, (17 more...)

Industry: Information Technology > Services (0.52)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.49)

#artificialintelligenceSep-17-2019, 22:14:30 GMT

Oracle Introduces Exadata X8M

SAN FRANCISCO, September 17, 2019 -- Oracle Exadata Database Machine X8M, available today, sets a new bar and changes the dynamics of the database infrastructure market. Exadata X8M combines Intel Optane DC persistent memory and 100 gigabit remote direct memory access (RDMA) over Converged Ethernet (RoCE) to remove storage bottlenecks and dramatically increase performance for the most demanding workloads such as Online Transaction Processing (OLTP), analytics, IoT, fraud detection, and high frequency trading. "With Exadata X8M, we deliver in-memory performance with all the benefits of shared storage for both OLTP and analytics," said Juan Loaiza, executive vice president, mission-critical database technologies, Oracle. "Reducing response times by an order of magnitude using direct database access to shared persistent memory accelerates every OLTP application, and is a game changer for applications that need real-time access to large amounts of data such as fraud detection and personalized shopping." Exadata X8M helps customers perform existing tasks faster and accelerates time-to-insight, while also enabling deeper and more frequent analyses.

artificial intelligence, persistent memory, real time system, (14 more...)

Country: North America > United States > California > San Francisco County > San Francisco (0.25)

Industry: Information Technology (0.36)

Technology:

Information Technology > Artificial Intelligence (0.71)
Information Technology > Architecture > Real Time Systems (0.56)